[draft] new reader for curry files, using curry-python-reader code #13176

dominikwelke · 2025-03-26T20:54:51Z

hi all
as discussed in #13033 here a draft PR to use the official curry reader code.

in a first step i just use the reader as a module (only fixed formatting to force it past pre-commit).
it has some drawbacks (e.g. data is always loaded) and i did not implement all possible data yet (eg hpi, or epoched recordings) but in general it already works pretty well. tested it with all their example data, and one of my own recordings that didnt work in mne before.

it would be great to get some feedback how you want me to proceed with this @drammock @larsoner:

do we want to stick with the module approach, leave their code untouched and work with the output (would allow easier updating when they push changes)
or should i merge the code more thoroughly. making it easier to maintain and in terms of clarity

BACKGROUND:
the curry data reader currently cant handle all/newer curry files
plan is port code from the official curry reader into mne-python

for permission see #12855

closes #12795 #13033 #12855

agramfort · 2025-03-28T07:46:11Z

mne/io/curry/curryreader.py

@@ -0,0 +1,633 @@
+# Authors: The MNE-Python contributors.
+# License: BSD-3-Clause
+# Copyright the MNE-Python contributors.


is this file the official reader? is this file copied from somewhere?

yes, it's copied from https://github.com/neuroscan/curry-python-reader

the false info you flagged was added by [autofix.ci]
further formatting changes were necessary to pacify pre-commit hook

you discussed this topic with a compumedics dev in #12855

they said they wont supply a pypi or conda version, but we are free to use the code.
the github repo has a BSD3 license applied, but they dont include any note in the file itself

drammock · 2025-03-28T14:33:38Z

do we want to stick with the module approach, leave their code untouched and work with the output (would allow easier updating when they push changes), or should i merge the code more thoroughly. making it easier to maintain and in terms of clarity

Given that @CurryKaiser refused our offer to help them package up their reader for PyPI / conda-forge, I see two remaining options:

"vendor" their code. To make it slightly future-proof, we could write a script (in tools/ I guess) that fetches the latest code from their repo, auto-formats it to make it compatible with MNE's pre-commit requirements, and puts the (formatted but otherwise unmodified) code in mne/io/curry/_vendored.py. (This is basically a manual version of git submodule update because I don't think we should invoke git submodule for this use case.) We then adapt our code in mne/io/curry.py to be a wrapper around their code that basically just gets things into proper MNE container objects; and know that we might need to tweak our wrappers any time the vendored code is updated.
Fully incorporate their code. Re-write their reader to align better with our codebase, in terms of variable names, idioms like _check_option or _validate_type, features like preload=False, etc.

Personally I lean toward option 2. I say this because if we're going to try to support curry files, at a minimum we need to be able to fix bugs when they arise, and ideally we should be willing/able to incorporate new features that have high demand from our users (preload=False is an obvious first example). But if we're fixing bugs, do we open PRs to upstream (with no guarantee of responsiveness), or tweak our "thin" wrapper to handle more and more corner cases? Neither option is appealing, so at that point it starts to seem easier to me to just maintain the entire reader ourselves.

agramfort · 2025-03-28T20:03:55Z

hum... my first reaction is to push a version 0.1 of their package on pypi and rely on this. Basically we maintain a fork and hope that the fork changes are accepted upstream... it feels less hacky and they also have a CI and some testing setup with test_data that I would not duplicate in mne-python...

…

Message ID: ***@***.***>

drammock · 2025-03-28T20:36:22Z

hum... my first reaction is to push a version 0.1 of their package on pypi and rely on this. Basically we maintain a fork and hope that the fork changes are accepted upstream... it feels less hacky and they also have a CI and some testing setup with test_data that I would not duplicate in mne-python...

that indeed is less hacky than my approach to vendoring. I'd be OK with that outcome, though curious what @larsoner will think.

larsoner · 2025-03-29T17:08:50Z

I'm fine with that idea but it would be good to get some blessing/permission from them to do this

drammock · 2025-03-31T19:31:47Z

@CurryKaiser

I'm fine with that idea but it would be good to get some blessing/permission from them to do this

xref to #12855 (comment) where I've asked for confirmation that Compumedics really doesn't want to be the packager and they're OK with us doing it.

CurryKaiser · 2025-04-01T09:54:18Z

@CurryKaiser

I'm fine with that idea but it would be good to get some blessing/permission from them to do this

xref to #12855 (comment) where I've asked for confirmation that Compumedics really doesn't want to be the packager and they're OK with us doing it.

And nothing has changed, so all good from our side. Sorry we couldn't package it for you. And thank you for working on this!

dominikwelke · 2025-04-01T18:34:25Z

thanks @CurryKaiser !

ok, sounds like a plan.. I can start working on this again soon, if you give a go @agramfort @drammock @larsoner

I guess the fork should live in the mne-tools org? I have the necessary rights to create it

larsoner · 2025-04-01T18:42:10Z

Yeah I think so

drammock · 2025-04-01T18:43:35Z

Yeah I think so

I already made the fork

agramfort · 2025-04-01T18:47:42Z

ok, sounds like a plan.. I can start working on this again soon, if you give a go @agramfort <https://github.com/agramfort> @drammock <https://github.com/drammock> @larsoner <https://github.com/larsoner>Message ID: ***@***.***>

+1

drammock · 2025-04-02T20:39:57Z

xref to mne-tools/curry-python-reader#1

dominikwelke · 2025-04-02T22:46:25Z

i could need some guidance on 2 things:

channel locations:
curry files come with channel locations, and for EEG it was straight forward to build a montage and apply.
but for MEG it seems i need to use other functions. any pointers would help!
do i need to populate info["dig"] directly?
HPI/CHPI data:
some MEG files seem to come with these data. how do i store them in the raw object?

a few other things to discuss:

preload
easiest would be to not offer preload=False and just load the data to memory.
a single load_data() call would also be doable with the official reader, but a chunk reader not really (if we dont want to hack it; e.g. load all data and discard large parts). not sure i'm deep enough in the mne codebase to know what the implications are (e.g. computations, plots etc with unloaded data)
what are your thought?
epoched files
the reader code looks as if there could be files with epoched recordings, but there are none among their sample files. do any of you know more about this? otherwise ill ask the curry devs

dominikwelke · 2025-04-02T22:46:32Z

p.s. and could you remind me how to switch off the CIs when pushing these early commits?

larsoner · 2025-04-03T16:32:41Z

Push commits with [ci skip] in the commit message and they long / expensive CIs shouldn't run (a few quick ones still will I think)

larsoner · 2025-04-03T16:38:56Z

... for the cHPI stuff it's probably easiest to load a Neuromag raw FIF file with cHPI info and look at how the info is stored for example in info["hpi_subsystem"]. You can also look at the Info docs, especially the notes. It's not complete but it will help.

For preload, since preload=False is in main it would be a functionality regression to remove it. Once you know how the data are stored on disk and how to read an arbitrary time slice from it, it's really not bad to do the work to make preload=False work. So if you can figure this part out in some function, I can help you fit it into the _read_segment_file code. Since the .py file is only a few hundred lines (a lot of which seems like plotting etc. that we won't use), I'm cautiously optimistic we can figure it out and make it work. And then the python-curry-reader code can really be for reading metadata, annotations, sensor locs, etc. plus knowing where to read the data from disk. We can probably even keep the existing _read_segment_file, it should work in theory...

dominikwelke · 2025-04-14T11:17:27Z

ok, _read_segment_file does indeed work unchanged.
the reader should now be more/less functional

I'd still need some guidance on handling/storing the channel locations, esp. for MEG data
HPI data - looks like I got it wrong - there might not be cHPI data after all, only HPI marker locations provided in different formats depending on the system

dominikwelke · 2025-04-14T11:29:05Z

@CurryKaiser
thanks for the permission to use the code, also from my side!

in another place you said you might be able to provide us with test files - could we perhaps get a small one with epoched recordings in it (format version shouldn't matter)?
your repository for the python reader contains some test files that the reader interprets as epoched, but they dont seem to really be (perhaps the files were truncated for size)

CurryKaiser · 2025-04-15T11:44:08Z

Could be that they were truncated, let me check.

CurryKaiser · 2025-04-15T13:26:29Z

Ok, try these:
EpochedData

dominikwelke · 2025-04-16T16:39:09Z

thanks for the file @CurryKaiser
fyi, we have now packaged and published the curryreader on PyPI.
it can be installed via pip install curryreader

dominikwelke · 2025-04-16T16:39:15Z

@drammock @larsoner @agramfort
it is on PyPI but not on conda-forge - how is this case dealt with in MNE? should we also submit it to conda forge?

currently pip install mne[full] fetches it, but conda env create --file environment.yml doesnt

related question:
which pip dependency level in pyproject.toml should this go to? i treated curryreader like the antio package (for ANT neuro files) but this makes it an optional requirement (in mne[full]). i believe this mean it wont be automatically installed when calling pip install mne?

larsoner · 2025-04-16T16:43:18Z

it is on PyPI but not on conda-forge - how is this case dealt with in MNE? should we also submit it to conda forge?

Yeah use grayskull, it's not too painful, see for example conda-forge/staged-recipes#28279

dominikwelke · 2025-04-17T09:35:14Z

Yeah use grayskull,

see conda-forge PR: conda-forge/staged-recipes#29754

mne/io/curry/tests/test_curry.py

mne/io/curry/curry.py

dominikwelke · 2025-05-21T21:52:00Z

I had some deadlines and will now be on vacation for a while. will continue working on the PR afterward

drammock · 2025-06-12T22:36:15Z

users are asking for this; xref to forum post: https://mne.discourse.group/t/mne-io-read-raw-curry-for-curry-9/11245/2

dominikwelke · 2025-06-16T17:37:43Z

users are asking for this; xref to forum post: https://mne.discourse.group/t/mne-io-read-raw-curry-for-curry-9/11245/2

good to hear that. just came back from annual leave and can start working on it again

dominikwelke · 2025-07-21T11:55:28Z

hi @larsoner @drammock -
this is now doing what it should and it would be great to get some more in-depth revision!
thanks in advance for your feedback :)

specific questions:

sensor digitization:
could someone with a better understanding of how MNE stores these have a look?
curry files can store eeg locations and meg locations, all as metric xyz coordinates (i think). especially in case of meg i'm not 100% confident if i set all FIFF flags correctly, or if there are transformation steps i missed.
what i implemented is partly copying the previous mne reader version, and a 3d plot of sensor locations (eeg+meg) looks ok.
tests:
feel free to suggest improvements, or extensions to better cover my code.
API 1:
curry files can store continuous or epoched data. for epoched data, i default to return an instance of Epochs but offer option to return a Raw with annotations (import_epochs_as_annotations=False).
good like this, or should i change the default / rename the argument?
(ps. maybe it can also store evoked data - the curryreader returns number of averages per epoch which suggest this might be the case, but i dont have example data for this, so i currently raise NotImplementedError. @CurryKaiser: can you tell us more about this? perhaps provide a small example file with evoked/averaged recordings?)
API 2:
apart from reader read_raw_curry i also added defs to read eeg montage (read_dig_curry) and impedance measurements (read_impedances_curry). do i need to add them somewhere to the docs?
the latter could also be included in https://mne.tools/stable/auto_examples/io/read_impedances.html, i guess?
potential privacy issue:
curry files can include amplifier/device information. i currently just dump this to device_info.type.
in some example files i saw that this info can be very long and include serial numbers and paths to local files.
as i didnt find any spec for the curry format its not easy to split the string meaningfully, to store this identifiable stuff in the device_info items that are overwritten by MNEs anonymization functions.
how should i go with this?
@CurryKaiser: is there a fix structure to the information stored in the AmplifierInfo field that i could use?
example:

Synamps MEG Headbox 1: Digital(PN:8509 SN:0029) Analog(PN:9227 SN:0020) FW:01 -- Headbox 2: Digital(PN:8509 SN:0063) Analog(PN:9227 SN:0059) FW:01 -- Headbox 3: Digital(PN:8509 SN:0026) Analog(PN:9227 SN:0045) FW:01 -- Headbox 4: Digital(PN:8509 SN:0019) Analog(PN:8548 SN:0012) FW:01 -- Headbox 5: Digital(PN:8509 SN:0078) Analog(PN:8548 SN:0070) FW:01 (SynAmps MEG), LP-Filter: order 2 Butterworth (Configuration: 186 MEG + Quik-Cap Net 128 01_23_2020) || MEG: H1 conf: E:\Curry_Configs\BNI_A_parameters20200220.cfg (Thu Feb 20 17:21:42 2020¶) | H1 sens: ./OrionMEG/MEG_186_system_CURRY_10092019.cfg (Wed Sep 11 02:57:42 2019¶) | H1 geom: ./OrionMEG/MEG_186_10092019 (Mon Sep 30 06:35:12 2019¶)

dominikwelke · 2025-07-21T11:56:31Z

btw, the style issue says: mne/decoding/base.py:272: unused attribute 'required' (60% confidence)
dont seem to be me?

larsoner · 2025-07-21T14:43:49Z

sensor digitization:

Yeah I can look a bit deeper... before I do, what's the situation on main? Are sensor positions loaded when you read_raw_curry?

The short version in case you want to think about if things are done correctly:

EEG-like sensors are stored in head coord frame, i.e., the right-handed RAS coordinate frame formed by LPA, RPA, and Nasion.
MEG-like sensor positions are stored in the MEG coord frame, which has a device-dependent origin (and up/orientation) but is otherwise RAS.
info["dev_head_t"] stores the coordinate transformation between the head and MEG coordinate frame. This should be non-None for recordings with humans.

Is this how things work in this PR (and on main)?

[read_impedances_curry] could also be included in https://mne.tools/stable/auto_examples/io/read_impedances.html, i guess?

Sure!

store this identifiable stuff in the device_info items that are overwritten by MNEs anonymization functions.

Seems reasonable to me

curry files can store continuous or epoched data. for epoched data, i default to return an instance of Epochs but offer option to return a Raw with annotations (import_epochs_as_annotations=False).

Why have an option to import as raw? To me this seems like a more general problem that we might want to have an Epochs method epochs.as_raw() that does the wrapping back to a 2D array/structure with annotations or whatever. Can we live without this for now and see if people end up wanting it?

larsoner · 2025-07-21T14:45:43Z

btw, the style issue says: mne/decoding/base.py:272: unused attribute 'required' (60% confidence)

Only tangentially related... vulture uses hueristics and I think by you removing the required from some function in the curry code, it now sees code in mne/decoding.py with a .required as being unused anywhere 🤷 To work around this just you should be able to add _.required to the vulture_allowlist.py and be good

dominikwelke · 2025-07-21T18:33:45Z

curry files can store continuous or epoched data. for epoched data, i default to return an instance of Epochs but offer option to return a Raw with annotations (import_epochs_as_annotations=False).

Why have an option to import as raw? To me this seems like a more general problem that we might want to have an Epochs method epochs.as_raw() that does the wrapping back to a 2D array/structure with annotations or whatever. Can we live without this for now and see if people end up wanting it?

i can remove the option, no prob.

dominikwelke · 2025-07-21T18:37:03Z

store this identifiable stuff in the device_info items that are overwritten by MNEs anonymization functions.

Seems reasonable to me

the point was: its not straightforward to parse and split the string ;)
fwiw there could be anything n there

a few options are:

leaving as if, leading to cases with incomplete anonymization
acting as if there were structure, and split the strings based on the limited examples i saw
ignoring this info and leaving device_info empty
dumping everything in one of the fields of device info that's overwritten instead of device_info.type

dominikwelke · 2025-07-21T18:44:23Z

sensor digitization:

Yeah I can look a bit deeper... before I do, what's the situation on main? Are sensor positions loaded when you read_raw_curry?

yes sensor locations are also loaded in the legacy version. as i said, i reuse parts of this code.

in the PR i do set the coordinate frames as intended (head for eeg, device for meg) and for eeg im generally quite confident everything is fine (including RPA, LPA and Nasion).
for meg not so much - i did set a head-dev transform but i want feedback. thanks for having a look!

dominikwelke · 2025-07-21T19:06:51Z

@larsoner -
vulture is indeed happy now.
new issue in check_neuromag2ft arose

agramfort reviewed Mar 28, 2025

View reviewed changes

drammock mentioned this pull request Mar 31, 2025

CURRY Data Format Reader only works in specific cases #12855

Open

dominikwelke changed the title ~~[draft] new reader for curry files, using curry-pyhon-reader code~~ [draft] new reader for curry files, using curry-python-reader code Apr 1, 2025

dominikwelke and others added 4 commits April 23, 2025 14:04

initial commit

6de026b

[autofix.ci] apply automated fixes

e640023

remove local curryreader module + impedance reader + minor fixes

d533a5f

add dependencies

63c4da3

larsoner reviewed May 16, 2025

View reviewed changes

mne/io/curry/tests/test_curry.py Outdated Show resolved Hide resolved

larsoner reviewed May 16, 2025

View reviewed changes

mne/io/curry/curry.py Outdated Show resolved Hide resolved

dominikwelke added 3 commits May 17, 2025 00:17

_soft_import + nesting

b005c57

add back and adapt some more tests

c75a8e4

add curryreader to circleci dependencies

99e968d

style

0364225

dominikwelke added 12 commits July 2, 2025 16:33

montage wip

e0c7eee

merge changes from upstream/main

ec4a25c

reintroduce test: test_read_files_missing_channel

1e01667

soft_import pandas

3ffd6aa

set sensor locations

3cca0d3

merge changes from upstream/main

5de6062

extract device_info + refactoring

2495cd9

test for epoched recording

feb7def

add changelog entry

e850201

merge changes from upstream/main

e452a1a

fixes, refactoring, documentation, clean up

7260675

Merge remote-tracking branch 'upstream/main' into new-curry-reader

a43d363

vulture_allowlist

fd15e6e

[draft] new reader for curry files, using curry-python-reader code #13176

Are you sure you want to change the base?

[draft] new reader for curry files, using curry-python-reader code #13176

Uh oh!

Conversation

dominikwelke commented Mar 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

agramfort Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

dominikwelke Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dominikwelke Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

drammock commented Mar 28, 2025

Uh oh!

agramfort commented Mar 28, 2025 via email

Uh oh!

drammock commented Mar 28, 2025

Uh oh!

larsoner commented Mar 29, 2025

Uh oh!

drammock commented Mar 31, 2025

Uh oh!

CurryKaiser commented Apr 1, 2025

Uh oh!

dominikwelke commented Apr 1, 2025

Uh oh!

larsoner commented Apr 1, 2025

Uh oh!

drammock commented Apr 1, 2025

Uh oh!

agramfort commented Apr 1, 2025 via email

Uh oh!

drammock commented Apr 2, 2025

Uh oh!

dominikwelke commented Apr 2, 2025

Uh oh!

dominikwelke commented Apr 2, 2025

Uh oh!

larsoner commented Apr 3, 2025

Uh oh!

larsoner commented Apr 3, 2025

Uh oh!

dominikwelke commented Apr 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dominikwelke commented Apr 14, 2025

Uh oh!

CurryKaiser commented Apr 15, 2025

Uh oh!

CurryKaiser commented Apr 15, 2025

Uh oh!

dominikwelke commented Apr 16, 2025

Uh oh!

dominikwelke commented Apr 16, 2025

Uh oh!

larsoner commented Apr 16, 2025

Uh oh!

dominikwelke commented Apr 17, 2025

Uh oh!

Uh oh!

Uh oh!

dominikwelke commented May 21, 2025

Uh oh!

drammock commented Jun 12, 2025

Uh oh!

dominikwelke commented Jun 16, 2025

Uh oh!

dominikwelke commented Jul 21, 2025

Uh oh!

dominikwelke commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

larsoner commented Jul 21, 2025

dominikwelke commented Mar 26, 2025 •

edited

Loading

dominikwelke Mar 28, 2025 •

edited

Loading

dominikwelke Mar 28, 2025 •

edited

Loading

dominikwelke commented Apr 14, 2025 •

edited

Loading

dominikwelke commented Jul 21, 2025 •

edited

Loading

dominikwelke commented Jul 21, 2025 •

edited

Loading